Derivation of the percolation threshold for the network model of Barabasi and Albert 
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The percolation threshold of the network model by Barabasi and Albert (BA-model) [Science 
286, 509 (1999)] has thus far only been 'guessed' based on simulations and comparison with other 
models. Due to the still uncertain influence of correlations, the reference to other models cannot be 
justified. In this paper, we explicitly derive the well-known values for the BA-model. To underline 
the importance of a null model like that of Barabasi and Albert, we close with two basic remarks. 
First, we establish a connection between the abundance of scale-free networks in nature and the 
fact, that power-law tails in the degree distribution result only from (at least asymptotically) linear 
preferential attachment: Only in the case of linear preferential attachment does a minimum of 
topological knowledge about the network suffice for the attachment process. Second, we propose a 
very simple and realistic extension of the BA-model, that accounts for clustering. We discuss the 
influence of clustering on the percolation properties. 

PACS numbers: 89.75.Da 



I. INTRODUCTION 

Scale- free networks, i.e., networks with essentially 
power-law degree distributions, have recently been widely 
studied (see 1] and Q for reviews). Such degree distri- 
butions have been found in many different contexts, for 
example in several technological webs like the Internet 
0, the WWW 3, or electrical power grids, and also in 
social networks, like the network of sexual contacts |5j or 
of phone calls [6j. 

The standard model reproducing scale free degree dis- 
tributions was introduced by Barabasi and Albert (BA- 
model) @ ■ It is based on a growth algorithm with prefer- 
ential attachment. Starting from an arbitrary set of ini- 
tial nodes, at each time step a new node is added to the 
network. This node brings with it m proper links which 
are connected to m nodes already present. The latter 
are chosen according to the preferential attachment pre- 
scription: The probability that a new link connects to 
a certain node is proportional to the degree (number of 
links) of that node. The resulting degree distribution of 
such networks tends to M 



P(k) 



2m(m + 1) 
fc(fc + l)(fc + 2) 



oc k 



(1) 



A second older model which is also widely studied in 
the context of scale-free networks is the configuration 
model (C-model). This model is usually attributed to 
Bollobas and was first treated in a context related to 
percolation by Molloy and Reed ^(j- This is to some 
extent the 'most random' network possessing a given de- 
gree distribution P{k) and a given number N of nodes. 
The building prescription starts with sets of NP(k) nodes 
with k stubs each. The stubs are then connected ran- 
domly to each other; two connected stubs form a link. 
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Double bonds and autoconnections can be neglected in 
the limit of large networks A*" — » oo. 



II. PERCOLATION CONDITION 
A. Theory 

Let us recall the condition for a network at the per- 
colation threshold 0, [^: A node i, linked to a node 
j in the spanning cluster, is connected to exactly one 
other node on average. This results in an average de- 
gree (k) — 2 of the spanning cluster. These properties 
are obviously properties of trees. More precisely, if the 
following conditions are satisfied, a network is at the per- 
colation transition: 

1. There exists a giant cluster, which is a tree (i.e., no 
loops) . 

2. The distance between two randomly chosen nodes is 
almost always infinite (in the thermodynamic limit 
N — > oo). That is, the fraction of pairs, for which 
it is not infinite, is zero. 

We prove this in the following manner: On a tree [con- 
dition (1)] there is always exactly one path between two 
nodes. Now, we randomly delete from the tree a frac- 
tion p of all links or nodes, corresponding to edge or site 
percolation respectively. Due to condition (2), the prob- 
ability that two arbitrarily chosen nodes still belong to 
one cluster is zero independent of p > 0: It is (1 — p) n , 
where n is the distance (number of links) between the 
two nodes, and since the distance diverges for almost all 
pairs of nodes, the probability, that the nodes are con- 
nected is zero. In consequence, there does not exist any 
cluster that consists of a non-zero fraction of nodes, i.e., 
no giant cluster exists. 
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B. Application to the BA-tree 

The BA-network with m = 1 (BA-tree) has two pecu- 
liar qualities: first it is exactly a tree, second it is fully 
connected, i.e., it consists of one single cluster (iff the 
starting network is fully connected - otherwise there are 
as many giant components as there were components in 
the starting network at t = 0). Using the conditions 
introduced above, we now examine, if p c = is the per- 
colation threshold. 

Obviously, condition (1) is fulfilled by the construction 
algorithm. Condition (2) also holds. In [Lj Cohen and 
Havlin calculate a lower limit for the diameter (mean 
distance between all nodes) of scale-free networks. For 
a given degree distribution they build a tree by starting 
with the node of the highest degree as root, and then 
subsequently adding as offspring the nodes with the next- 
highest degrees. When a shell is full, nodes are added to 
the next shell etc. (CH-model) (To the same shell belong 
all nodes with a fixed distance from the root.) Cohen 
and Havlin find that the number of shells diverges in 
the case of the distribution of the BA-model. This of 
course does not immediately entail the divergence of the 
distance between almost all pairs of nodes. 

To prove the latter for the CH-model, it suffices to 
consider pairs including the node with the highest de- 
gree, because it has the maximum number possible of 
neighbors in all shells. Now, from the construction al- 
gorithm of the CH-model as described above we know 
that on every shell there are at least as many nodes than 
on the previous (excluding the last shell, because nodes 
on the penultimate shell can have degree one). This to- 
gether with the fact that the number of shells diverges, 
proves that the distance between the node with the high- 
est degree and almost all other nodes diverges in the CH- 
model. Now, we consider all networks that are trees and 
that have the same degree distribution as the BA-model 
with m = 1. When in any such network we arbitrarily 
choose one node as root and count the number of nodes 
in the shells around it, we find, that there are always less 
nodes in each shell than in the corresponding shell of the 
CH-model, where we take the node with the highest de- 
gree as root (except again the last shell). It follows that 
condition (2) must be fulfilled for the BA-model. 

We have shown, that the percolation threshold for the 
BA-model with m = 1 is p c — 0. Of course, the per- 
colation threshold for a C-network with the same de- 
gree distribution is p c = 1 due to the diverging second 
moment mill. So, for the BA de gree distribution with 
m = I, p c depends strongly on correlations. 

C. Simulations 

The simulations in Fig. ^ confirm the theoretical dis- 
cussion from above. The graph documents the perco- 
lation process for different network sizes from 200 to 
500,000 nodes. We plotted for different p the relative 
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FIG. 1: Link removal in BA-networks with m = 1. The 
scaling of the percolation process with increasing network size 
is examined. The nine curves correspond to p = 0.1-p = 0.9 
in steps of 0.1 from top to bottom. The values for the different 
network sizes were averaged over 9999 runs for 200, 500, and 
1000 nodes, over 500 runs for 5000 nodes, over 100 runs for 
10,000 nodes and 20 runs for both 50,000 and 500,000 nodes. 
The graph suggests that for infinite network size, there will 
be no giant cluster independent of p > 0. 

size s of the giant component depending on the network 
size. It is shown that for a fixed p > the relation be- 
tween the relative size of the giant component s and the 
network size can be described by a simple power-law with 
a negative exponent. Thus, for all p > 0, the relative size 
of the giant component s approaches zero for N — > oo. 



III. THE ROLE OF THE TREE STRUCTURE 

We will now try to determine the origin of the per- 
colation threshold p c — for the BA-tree. As already 
mentioned, this threshold contradicts the common notion 
(proven for the C-model), that for power-law exponents 
7 < 3 the diverging second moment of the degree distri- 
bution yields a percolation threshold p c = 1. Does the 
unusual percolation threshold p c = 1 stem from the pe- 
culiarities of the BA-tree? In the following we examine 
the importance of the tree topology. For this purpose we 
determine the change of the percolation threshold when 
we randomly add a few links to the BA-tree. Therefore, 
we calculate the cluster-size distribution for the BA-tree 
depending on p. 



A. The cluster-size distribution for the BA-tree 

In the BA-model each link has a direction determined 
by the preferential attachment rule: Each node has ex- 
actly m incoming or proper links that are attached to it 
at the moment when the node enters the network. All 
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FIG. 2: An exemplary cluster. Each node has an incoming 
link in the direction of the arrow. All other links are counted 
as outgoing. The dotted lines are the removed links, the solid 
ones are the links drawn. On the far left we see the only 
removed incoming link for the whole cluster. 



other links will be called outgoing links. In the follow- 
ing we consider m = 1, the connected tree. We remove 
a certain fraction p of all links from the network. Then 
the network breaks into as many clusters as links are re- 
moved (plus one). For every cluster one incoming link 
was deleted and a certain number of outgoing links. In 
Fig. |21 an exemplary cluster is shown. There, links enter 
nodes in the direction of the arrow. 

We are interested in the cluster-size distribution in the 
limit of large cluster sizes. First, we count the outgoing 
links of a cluster. L s drawn outgoing links will belong 
to this cluster. In the whole network, only (1 — p) of all 
links are drawn. Then, L u — x-L s deleted outgoing links 
belong to the cluster. In the limit of clusters with many 
nodes, we have x — p/(l — p). 

We now examine the development of a percolated net- 
work, i.e., already at the entry of a node the decision is 
made, if its proper link is drawn. We can formulate a 
dynamical equation for the evolution of the cluster sizes 
with a large number of nodes k: 



d . C(k-l) (fc-2)(2 +T ^) 

—C(k) = (1-pM - 

dt y K I 2N 



c(k) ( fc -i)(2 + T ^; 



2N 



(2) 



C(k) is the number of clusters with k nodes in the net- 
work. At every time-step a new node is added, whose link 
is drawn with probability (1 —p), which is the first fac- 
tor on the right-hand-side of the equation. If the link is 
deleted, automatically a cluster of size 1 is added to the 
network. We neglect this term since we are interested 
only in the limit of large k. The first addend on the 
right-hand-side accounts for the new node being linked 



to a cluster with size k — 1. The number of drawn stubs 
is 2(k — 2), and (k — 2)p/(l — p) is the number of deleted 
(outgoing!) stubs. (A stub is half a link; we neglect the 
single stub of the one deleted incoming link.) The sum 
of both yields the total number of stubs in the cluster 
that determines the linear preferential attachment of a 
new node to the cluster. 2N is approximately the total 
number of stubs in the network with N nodes. The sec- 
ond addend accounts for the new node being linked to a 
cluster of size k. 

We now assume that C(k) = c(k)t = c(k)N with c(k) 
constant for large N, where N is the number of nodes in 
the network. (This follows immediately from the statis- 
tical nature of the networks.) We have 



c(k) = (l-p)<jc(fc-l) 
-c(fc) 



(fc-2)(l + i /( 



2 1 -p 



(3) 



2 1 -p 

We can solve this equation for c(k)/c(k — 1) with k 3> 1: 
c(k) k-2 



c(fc-l) fc-l+ i • 

2 V 

In the limit of large k follows 



(4) 



c{k) oc k 



(5) 



We see that for < p < 1, the exponent lies between 2 
and 3. We checked Eq. J5J in Fig. [SJ where we plotted 
the cluster-size distributions for BA-trees with 100,000 
nodes. For five values between 0.3 and 0.9 we found a 
good correspondence between the simulated curves and 
the theoretical prediction. 



B. Randomly adding links to the BA-tree 

The calculation of the cluster-size-distribution is not 
easily generalized to BA-models with m > 1. We will 
now destroy the tree topology by adding links between 
randomly and uniformly chosen nodes. We will call these 
links R-links. Note, that this does not change the ex- 
ponent of the power-law tail of the degree distribution. 
Does it change the percolation threshold? The new links 
connect the 'old clusters, which are distributed according 
to C(k). The number of R-links added to a cluster is for 
large clusters proportional to the cluster size k, i.e., the 
number of nodes in the cluster. 

Let r be the average number of links added to every 
node. Now, it can be shown that as long as r > a 
spanning cluster exists that comprises a finite fraction 
of all nodes in the network. The argument is analogous 
to the reasoning why there is a spanning cluster in a 
C-network with diverging second moment of the degree 
distribution. When following one of the newly added R- 
links, we find the following probability distribution for 
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FIG. 3: Distribution of cluster-sizes in BA-trees with 100,000 
nodes each. The five curves correspond to p = 0.3, 0.5, 0.7, 
0.8, and 0.9 (from top to bottom, each averaged over 50 runs). 
The dashed lines show the corresponding theoretical results 
for the exponent of the power-law tail, resulting for a double- 
logarithmic plot in a line with slope —1 — 1/(1 — 0.5p). The 
offset for the theoretical curves is adjusted manually. 



encountering a cluster of size k: kC(k) /E fc kC(k)\. For 
large enough k, the total number of R-links connecting 
to such a cluster of size k will approximately be rk. Now, 
when we follow an R-link to a cluster, we can calculate 
the average number of outgoing R-links from this cluster: 

£*;(*- l)C(fc) 

rk EkC(k) • (6) 

k 

If this expression diverges a finite fraction > of all clus- 
ters will be connected after the addition of the R-links 
(for the same reason that in the C-model with diverg- 
ing second moment of the degree distribution a spanning 
cluster develops). To this 'super-cluster' clusters with a 
large number of nodes will belong with a higher prob- 
ability than small clusters and it follows directly that 
the super-cluster also comprises a finite fraction > of 
the nodes in the network, i.e., the super-cluster is also a 
spanning cluster. 

We calculated in Sec. lIII Al that for all p between and 
1 the cluster-size distribution c(k) of the BA-tree has a 
power-law tail with exponent between 2 and 3. Since for 
these c(k) the expression © diverges, there always exists 
a giant component (the super-cluster mentioned above) 
in the BA-tree with additional random links independent 
of r > 0. That leads to a percolation threshold p c = 1 
for the model with additional links opposed to p c = 
for the BA-tree. In this sense the BA-tree has a critical 
topology. 
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FIG. 4: Mapping m = 1 (left side) on m — 2 (right side) 
networks. Pairs of nodes with difference in age of one are 
combined into one single node, a etc. denote here the age of 
the nodes. 



C. Mapping the m = 1 B A- model onto a m = 2 
BA-model 

In this section we prove that the BA-model with m > 1 
has a percolation threshold of p c = 1. There exists an 
easy way of mapping the BA-model with m = 1 onto the 
BA-model with m = 2. The prescription is the following 
(cp. Fig. 0| : 

1. Partition the whole network into pairs of nodes 
with a difference in age of one (the age of a node is a 
natural number representing the moment when the 
node is added to the network, nodes are numbered 
consecutively). 

2. Replace each pair of nodes by a single node. 

With the same effect we can add a 'non-removable' 
link between the two nodes of each pair ('non-removable' 
links are not affected by percolation). With the addition 
of the 'non-removable' links we add to every node of the 
BA-tree one additional link - independent of < p < 1. 
When translating the number of nodes in a cluster of 
the BA-tree with additional links to the number of nodes 
in a cluster of the BA-model with m = 2, we have to 
divide the number of nodes by 2, because the entity of 
two nodes and one 'non-removable' link corresponds to 
one single node in a B A- network with m = 2. 

To apply the considerations of the last section UlI Bl in 
order to prove that p c = 1 for a BA-model with m = 2, we 
still have to show that the 'non-removable' links between 
the nodes are randomly distributed. Actually it suffices 
to consider only large clusters of size k > fco with an 
arbitrary but finite ko- (Those large clusters comprise a 
non-zero fraction of all nodes in the network.) Because: 
if there is a giant component in the subnetwork of those 
large clusters, then there is also a giant component in the 
whole network, since the large clusters already constitute 
a finite part of the network. 

Due to the statistical nature of our networks, we can 
say that in all large clusters we find approximately the 
same age distribution. For this reason we can say that the 
large clusters are randomly linked to each other and that 
the probability for connecting to a cluster j with one 'non- 
removable' link (originating in cluster i) is proportional to 
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the number of nodes in cluster j . Thus, the network of the 
large clusters corresponds to the network-model treated 
in the last section UlI Bl Since in the subnetwork of large 
clusters, the cluster-sizes arc distributed according to a 
power-law with an exponent < 3, there exists a giant 
component in the whole network (as we have proven in 
the previous section). 

Who is not convinced by this statistical argument, con- 
sider a percolated BA-network of size N with m = 1 
and a certain p. In such a network we observe a certain 
cluster-size distribution according to Eq. (JSJ ■ We now let 
the network develop to size 1.5N with m = 2, i.e., from 
now on every new node shall have two proper links. If 
both proper links are drawn [probability (1 — p) 2 }, then 
the new node will link two clusters in the original net- 
work of size N with a probability larger than 1/4 (lower 
limit for the probability that both links connect to the 
original network). Since according to preferential attach- 
ment the new links will connect to existing clusters only 
dependent of the cluster-size, these new nodes will serve 
as bridges between clusters exactly in the way required in 
Section fill Bl Thus, independent of p < 1, BA-networks 
with m = 2 will have a giant cluster, q.e.d. 

This consideration can easily be generalized to BA- 
networks with other m > 1. We can even treat networks 
with fractional (m), where (m) is the average number of 
additional links added per step (with an upper boundary 
mo for the number of links added in each step). For 
(m) > 1 the percolation threshold is always p c = 1. 



IV. A NOTE ON SCALE-FREE NATURE AND 
LINEAR PREFERENTIAL ATTACHMENT 

In the following we will briefly address a few essential 
aspects of the B A- model. These underline the impor- 
tance of this model as a null model. First, we will es- 
tablish a connection between the abundance of scale-free 
networks in nature and the fact, that scale- free degree 
distributions emerge only for asymptotically linear pref- 
erential attachment. For linear preferential attachment 
only minimal topological knowledge about the network 
is required for attaching a new node to the network. 

It is well known that non-linear preferential attach- 
ment results in degree distributions, that are not scale- 
free [l4|. In the sublinear case the degree distribution 
is a stretched exponential. In the asymptotically linear 
case, the degree distribution follows a power-law asymp- 
totic behavior for large degrees k. In the super-linear 
case, a 'winner takes all' phenomenon arises, i.e., a single 
dominant gel node emerges. 

We now show, why linear preferential attachment is in- 
dispensable for an economy in the information required 
for the attachment of a new node. Only for linear prefer- 
ential attachment can the new node connect in a hierar- 
chical process, subsequently increasing the resolution of 
only the portion of the network that the node will attach 
to. 




X -f 

FIG. 5: The component transformation. Arbitrary nodes are 
grouped together to form a single node. All outgoing edges 
are preserved in the process. Edges connecting nodes within 
one group are converted into autoconnections. 

Consider somebody who wants to publish a new web- 
page about a certain topic, e.g. India. To which already 
existing pages does he link his page? There are mil- 
lions of webpages about India, that can be categorized 
into several topics. Our India-fanatic chooses a few of 
those topics (first level) : For example, he wants to make 
a link to a travel agency that organizes tours through 
India. Then, he chooses one of the agencies according 
to their whole Internet representation, combining sev- 
eral webpages (second level). Following his choice, he 
will look in detail at the web-pages of the agency, and 
will then decide to which page exactly he makes his link 
(third level). We will find in the following that this hier- 
archical process of decision-making is only possible with 
linear preferential attachment. 

We introduce the component transformation that lets 
us combine arbitrary groups of nodes to hyper-nodes. 
The component transformation allows to look at the net- 
work at different resolutions. At lower resolutions, the 
amount of information necessary for the characterization 
of the network is smaller. The component transformation 
allows that a new node attaching to the network can do 
so in a hierarchical process, consecutively increasing the 
resolution of only the part of the network, it attaches 
to. We find that this hierarchical connection procedure, 
which follows the same preferential attachment rule at 
every resolution, is only possible for networks with linear 
preferential attachment. 

We define the component transformation as shown in 
Fig. 

1. Nodes are arbitrarily grouped into neighborhoods, 
so that each node can be assigned to exactly one 
neighborhood. 

2. Each neighborhood is replaced by a hyper-node. 

3. Links between nodes of different neighborhoods 
are converted into links between the corresponding 
hyper-nodes. 

4. Each link between nodes of the same neighborhood 
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is converted into an autoconnection of the corre- 
sponding hypcr-nodc. 

Using the component transformation we can consider 
each network at many different levels/resolutions. Af- 
ter a component transformation the network can again 
be subjected to another component transformation. In 
each transformation, information is lost. However, for 
a new node i this information is irrelevant for the deci- 
sion, to which hyper-node j it will connect according to 
preferential attachment. 

Then, to attach node i to the network it suffices to 
look at the fine structure only of the hyper- node j, while 
the fine structure of all other hyper-nodes is irrelevant 
for the attachment process. Finally, the fine structure of 
the hyper-node can again have a superfine structure etc. 

We now prove, that this hierarchical process is only 
possible for linear preferential attachment 



n 

Here, the probability n^, that a new node attaches to 
the old node i, is proportional to the degree of that node 
fcj. The component transformation is ^ 7 i i— > I, i.e., the 
nodes i are combined to groups Ii, which are replaced by 
the hyper-node /. We also have fcj i— > fcj, i.e., the 
degree of the hyper-node fcj equals the sum of degrees of 
all nodes in Ii. Now, with linear preferential attachment 
the probability, that a new node attaches to /, is 

/ I h 

So, the probability that a node attaches to a group of 
nodes Ii is equal to the sum of the probabilities that it 
attaches to one of the nodes in the group. We stress, that 
this does not work anymore with non-linear preferential 
attachment: 

/ i ii 

So, we have shown, that linear preferential attachment 
assures that a new node does not have to know the fine 
structure of the whole network in order to 'decide', which 
part it will attach to. The new node does not need to 
know the topology of the whole network, and it does not 
need to know at which resolution it is looking at the net- 
work. This scale-independent quality of the attachment 
procedure only works for linear preferential attachment 
and is lost for non-linear preferential attachment. We 
have shown that requiring a minimization of the amount 
of information available to the new node results in linear 
preferential attachment, and thus in a scale- free degree 
distribution. These considerations can easily be general- 
ized to linear preferential attachment with initial attrac- 
tiveness Ai 7^ of node i. The hyper-node / then simply 



has the attractiveness A] = J^i ^-i- An essential qual- 
ity for the preferential attachment seems to be, that the 
value/worth attributed to a larger portion of a network 
is equivalent to the sum of the values of its parts. 

V. CLUSTERING AND THE BA-MODEL 

The most problematic aspect of the BA-model is the 
lack of clustering, which stands in harsh contrast to ob- 
servations in real networks. We will in the following pro- 
pose a very simple extension of the BA-model, that allows 
to implement a wide range of clustering, while it exactly 
preserves the degree distribution of the BA-model. 

We assume that every new node added to the network 
brings with it m = 2 proper links. These proper links 
connect the new node with nodes in the network accord- 
ing to different criteria. For example, in a friendship- 
network, every individual would have the right to make 
two friends. The first friend he chooses from people who 
do the same job as he. The second friend he chooses from 
people who have the same favorite hobby. Both times he 
preferably befriends those people that already have a lot 
of friends (i.e., preferential attachment). 

The new feature compared with the BA-model is, that 
at its introduction we assign to each node i two parame- 
ters, a job-parameter < pjj < 1 and a hobby-parameter 
< Ph,i < 1- Each new node has a job-link and a hobby- 
link. Now, according to preferential attachment we first 
determine the degree k x of node x, that the job-link will 
attach to. Then we search for that node x with degree 
k x that has the pj. x closest to pjj, corresponding to the 
best matching of common interests. The same procedure 
determines node y, that the hobby-link of i attaches to. 
The parameters pj and ph can of course be identified with 
two-dimensional coordinates in a geography. A general- 
ization to more than two parameters is straight-forward. 

Note, that by definition of the evolution-algorithm the 
degree distribution develops in the same way as for BA- 
networks, independent of the clustering effect. This can 
be seen on a step by step basis. Every time a new node 
and its two proper links are added, the existing degree 
distribution together with the preferential attachment 
procedure exclusively determines the further evolution 
of the degree distribution. The topology of the network 
plays no role. Thus, the degree distribution of our model 
including clustering will not be different from that of a 
simple BA-model. 

Qualitatively, the clustering depends on the correla- 
tions between the parameters pj.i and ph,i- As a rule, we 
choose pjj uniformly at random. phA is chosen depending 
on the value pjj. There are two limiting cases: 

1. The choice of ph,i is independent from the choice 
of pjj. Then our model corresponds exactly to the 
BA-model with m = 2 and exhibits a very small 
clustering coefficient, that vanishes as A — > oo. 
(The clustering coefficient for a single node i is com- 
monly defined as the number of direct neighbors of 
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i, that are linked with each other, divided by the 
number of possible pairs of direct neighbors of i. 
The clustering coefficient of the whole network is 
the avera ge o f all clustering coefficients for the sin- 
gle nodes [l5j-1 

2- Ph.% — Pj.i — Pi- Then the clustering is maximal. 

For the second case, when the degrees k x and k y of nodes 
x and y are equal, a double bond would be formed. If as 
an additional rule we prohibit double bonds, the second 
link will be connected to a node y with the parameter 
closest to pi but unequal node x. Now, the clustering 
is maximal, because the probability that nodes x and y 
are neighbors is maximal. In the case that x and y are 
neighbors a new triangle is formed in the network. Note, 
that in this case the clustering can be fairly independent 
of the network size. 

Let us finally address the percolation properties of 
these clustered networks. It is clear, that for strong cor- 
relations between ph and pj as in the second case the ar- 
gument presented in Section IlII CI is not valid anymore. 
The reason for this is, that the additional undeletable 
links tend to link nodes in the same cluster. Mapped on 
a B A- model with m = 1, job- and hobby- nodes would 
be added alternately to the network. Subsequent nodes 
would pairwise have the same parameter pi, and would 
consequently be linked to the same neighborhood with 
a high probability. So, when we reconstruct the m = 2 
model, in many cases nodes would be combined, that 
already belong to the same neighborhood. In case of 
strong correlation between the hobby and job parame- 
ters, neighborhoods form with similar job- and hobby- 
parameters. However, as pointed out in Section IIII Bl 
the peculiar percolation behavior p c — 1 will result, if 
just a few small- world links are added: For example if an 



arbitrarily small but finite fraction of hobby-parameters 
are chosen totally independent of the job-parameter, the 
percolation threshold p c — 1 would be recovered. 



VI. CONCLUSION 

We have proven, that the percolation threshold of the 
Barabasi- Albert model is in fact p c = 1, except in the 
case m = 1, when only one link is added for each new 
node. To our knowledge this was shown here for the first 
time in the physics literature. As pointed out by the 
anonymous referee, in 16] Bollobas and Riordan also 
briefly address, how from their derivation of the diameter 
of BA-networks follows the value for the percolation 
threshold of BA-networks with m > 2. Apart from 
that, thus far the value of the percolation threshold 
was based only on simulations and comparison with the 
configuration model. It is worth noting that the result 
achieved here is perhaps not as evident as it may seem. 
Due to the uncertain role of correlations, the BA-model 
and the configuration model are not equivalent in the 
percolation properties as can be seen from the case of 
the BA-model with m = 1. We closed with remarks 
underlining the importance of the BA-model as a null 
model and how clustering can be naturally accounted for. 
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